Japanese Case Frame Construction by Coupling the Verb and its Closest Case Component
نویسندگان
چکیده
This paper describes a method to construct a case frame dictionary automatically from a raw corpus. The main problem is how to handle the diversity of verb usages. We collect predicate-argument examples, which are distinguished by the verb and its closest case component in order to deal with verb usages, from parsed results of a corpus. Since these couples multiply to millions of combinations, it is difficult to make a wide-coverage case frame dictionary from a small corpus like an analyzed corpus. We, however, use a raw corpus, so that this problem can be addressed. Furthermore, we cluster and merge predicate-argument examples which does not have different usages but belong to different case frames because of different closest case components. We also report on an experimental result of case structure analysis using the constructed case frame dictionary.
منابع مشابه
An Empirical Architecture for Verb Subcategorization Frame - a Lexicon for a Real-world Scale Japanese-English Interlingual MT
The verb subcategorization frame information plays a major role of disambiguations in many NLP applications. Japanese, however, imposes difficulties of subcategorizing in part because it allows arbitrary ellipses of case elements. We propose a new type of verb subcategorization frame code set that combines the verb's surface case set and the deep case set, as a solution to the difficulties of e...
متن کاملAn Empirical Architecture for Verb Subcategorization Frame - a Lexicon for a Real-world Scale Japanese-English Interlingual MT
The verb subcategorization frame information plays a major role of disambiguations in many NLP applications. Japanese, however, imposes difficulties of subcategorizing in part because it allows arbitrary ellipses of case elements. We propose a new type of verb subcategorization frame code set that combines the verb's surface case set and the deep case set, as a solution to the difficulties of e...
متن کاملرشد جنبه معنایی فعل در کودک فارسیزبان: مطالعه طولی
Objective Learning “verb” as one of the main components of sentence, has been always a debatable topics in the process of language learning. One of the important issues in “verb” learning is determining its meaning using syntactic clues and learning its semantic aspects. Therefore, the main objective of this study was to examine the development of the semantic aspect of ...
متن کاملNote on Japanese Epistemic Verb Constructions: A Surface-Compositional Analysis
This paper offers a new analysis of the raising to object construction in Japanese. This has been extensively discussed following Kuno (1976) for the case where the matrix predicate is an epistemic verb. Under CCG analysis an o-marked phrase is a surfacecompositional object rather than a raised argument. This new approach correctly predicts the thetic and categorical judgments of epistemic verb...
متن کاملJapanese Case Structure Analysis
In Japanese, case structure analysis is very imt)ortant to handle several t roublesome characteristics of Japanese snch as scrambling, onfission of ease components, mid disappearance of case markers. However, fi)r lack of a widecoverage ease frame dictionary, it has been difficult to perfornl case structure analysis accurat;ely. Although several methods to construct a ease fl'mne dictionary fro...
متن کامل